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Abstract 

We study the problem of 2-dimensional orthogonal range counting with additive error. Given 
■ a set P of n points drawn from an n x n grid and an error parameter s, the goal is to build a 

data structure, such that for any orthogonal range i?, the data structure can return the number 
of points in P (~l R with additive error en. A well-known solution for this problem is the e- 
^vq , approximation. Informally speaking, an e-approximation of P is a subset A C P that allows us 

to estimate the number of points in P n R by counting the number of points in A n R. It is 
known that an e-approximation of size 0(~ log 2 ' 5 i) exists for any P with respect to orthogonal 
ranges, and the best lower bound is f2(- log -). 

The e-approximation is a rather restricted data structure, as we are not allowed to store any 



information other than the coordinates of a subset of points in P. In this paper, we explore 
what can be achieved without any restriction on the data structure. We first describe a data 



structure that uses 0(| log | log log | logn) bits that answers queries with error en. We then 
prove a lower bound that any data structure that answers queries with error O (log ri) must 
use Q(n\ogn) bits. This lower bound has two consequences: 1) answering queries with error 
0(log n) is as hard as answering the queries exactly; and 2) our upper bound cannot be improved 
£T) ■ in general by more than an O (log log - ) factor. 
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O ; 1 Introduction 



Range counting is one of the most fundamental problems in computational geometry and data 
structures. Given n points in d dimensions, the goal is to preprocess the points into a data structure, 
such that the number of points in a query range can be returned. Range counting has been studied 
intensively, and a lot of work has focused on the space-query time tradeoff or the update-query 
tradeoff of data structures. We refer the reader to the survey by Agarwal and Erickson [1] for 
these results. In this paper, we look at the problem from a data summarization/compression point 
of view: What is the minimum amount of space that is needed to encode all the range counts 
approximately? Approximation is necessary here, since otherwise we will have to remember the 
entire the point set. It is also easy to see that relative approximation will not help either, as it 
requires us to differentiate between empty ranges and those containing only one point. Thus, we 
aim at an absolute error guarantee. As we will be dealing with bit-level space complexity, it is 
convenient to focus on an integer grid. More formally, we are given a set of n points P drawn from 
an n x n grid and an error parameter e . The goal is to build a data structure, such that for any 
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orthogonal range R, the data structure can return the number of points in P n R with additive 
error en. 

We should mention that there is another notion of approximate range counting that approxi- 
mates the range, i.e., points near the boundary of the range may or may not be counted [3]. Such 
an approximation notion clearly precludes any sublinear-space data structure as well. 



1.1 Previous work 



e-approximations. Summarizing point sets while preserving range counts (approximately) is 
a fundamental problem with applications in numerical integration, statistics, and data mining, 
among many others. The classical solution is to use the e- approximation from discrepancy theory. 
Consider a range space (P,TZ), where P is a finite point set of size n. A subset iCPis called an 
e-approximation of (P, 1Z) if 

\RnA\ \Rr\P\ 



max 
ReTl 



< e. 



\A\ \P\ 

This means that we can approximate |-RnP| by counting the number of points in Rf)A and scaling 
back, with error at most en. 

Finding e-approximations of small size for various geometric range spaces has been a central 
research topic in computational geometry. Please see the books by Matousek [14] and Chazelle [8] 
for comprehensive coverages on this topic. Here we only review the most relevant results, i.e., when 
the range space 1Z is all orthogonal rectangles in 2 dimensions. This question dates back to Beck 
[5], who showed that there are e-approximations of size 0{j log 4 ^) for any point set P. This was 
later improved to O Qlog 2 ' 5 ^) by Srinivasan [20]. These were not constructive due to the use of 
a non-constructive coloring with combinatorial discrepancy 0(log 2 ' 5 ra) for orthogonal rectangles. 
Recently, Bansal [4] proposed an algorithm to construct such a coloring, and therefore has made 
these results constructive. On the lower bound side, it is known that there are point sets that 
require e-approximations of size log i) [5]. 

Combinatorial discrepancy. For a range space (P, 7Z) and a coloring function x '■ P — > {— 1, +1}, 
we write x(P n R) = ^ pe p n /jX(p)- The combinatorial discrepancy of the range space (P,TZ) is 
defined as 

disc(P, 7Z) = minmax \y(P fl R)\ , 
x Ren 

namely, we are looking at the coloring that minimizes the color difference of any range in 1Z. 
Taking the maximum over all point sets of size n, we say that the combinatorial discrepancy of 1Z 
is disc(n,7£) = max|p| =n disc(P, TZ). 

There is a close relationship between combinatorial discrepancy and e-approximations, as ob- 
served by Beck [5]. For orthogonal ranges, the relationship is particularly simple: The combinatorial 
discrepancy is at most t(n) if and only if there is an e-approximation of size 0(^t(^)). In fact, all the 
aforementioned results on e-approximations follow from the corresponding results on combinatorial 
discrepancy. So the current upper bound on the combinatorial discrepancy of orthogonal rectangles 
in 2 dimensions is 0(log 2 ' 5 n) [20]. The lower bound is O(logn) [5], which follows from the Lebesgue 
discrepancy lower bound (see below). Closing the 0(log L5 n) gap between the upper and the lower 
bound remains a major open problem in discrepancy theory. In d > 3 dimensions, the current best 
upper bound is 0(log d+1 ^ 2 n) by Larsen [13], while the lower bound is Q((lognY d ^ 1 ^ 2+TI ), where 
7] = n(d) is a small constant depending on d [7] . 
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Lebesgue discrepancy. Suppose the points of P are in the unit square [0, l) 2 . The Lebesgue 
discrepancy of (P, TV) is defined to be 

D(P,K) = sup \ \Pr\R\- \Rn [0,1) 2 || . 
Ren 

The Lebesgue discrepancy describes how uniformly the point set P is distributed in [0, l) 2 . Taking 
the infimum over all point sets of size n, we say that the Lebesgue discrepancy of TZ is D(n, TV) = 
mi lPl=n D(P,TV). 

The Lebesgue discrepancy for orthogonal rectangles is known to be 0(logn) in 2 dimensions. 
The lower bound is due to Schmidt [18], while there are many point sets (e.g., the Van der Corput 
sets [21] and the 6-ary nets [19]) that are proved to have O(logn) Lebesgue discrepancy. It is well 
known that the combinatorial discrepancy of a range space cannot be lower than its Lebesgue dis- 
crepancy, so this also gives the ^(log n) lower bound on the combinatorial discrepancy of orthogonal 
rectangles mentioned above. 

Approximate range counting data structures. The e-approximation is a rather restricted 
data structure, as we are not allowed to store any information other than the coordinates of a 
subset of points in P. In this paper, we explore what can be achieved without any restriction on 
the data structure. In 1 dimension, there is nothing better: An e-approximation has size O(^), 
which takes O(-logn) bits. On the other hand, simply consider the case where the n points are 
divided into groups of size en, where all points in each group have the same location. There are 
n l / £ such point sets and the data structure has to differentiate all of them. Thus log(n 1//£ ) = ^ logn 
is a lower bound on the number of bits used by the data structure. 

1.2 Our results 

This paper gives almost matching upper and lower bounds on the space needed by any data structure 
that encodes all the orthogonal range counts with error en. We first describe a data structure that 
uses 0(i log i log log i logn) bits. This is a ©(log 15 ^) (ignoring 0(loglog factors) improvement 
from e-approximations. Of course, we will store some extra information other than the coordinates 
of the points. 

On the lower bound side, we prove that there exists a constant c, such that any data structure 
that answers queries with error clogn must use O(nlogn) bits. This lower bound has two conse- 
quences: 1) answering queries with error O(logn) is as hard as answering the queries exactly; and 
2) our upper bound cannot be improved in general by more than an 0(loglog ^) factor. 

Our lower bound is information-theoretic: We show that there is a collection V* of 2 n (" lo s n ) 
point sets that are pairwise "different enough" in terms of range counts. More precisely, we show 
that the union of any two point sets in V* has high combinatorial discrepancy, i.e., at least clogn. 
Then, for any two point sets P\,P% £ V* , if disc(Pi U P21TZ2) > clogn, where 7^2 is the set of all 
orthogonal rectangles, that means for any coloring x on A U P2, there must exist a rectangle R 
such that |x(-R)| > clogn. Consider the coloring x where x(p) = 1 if p £ Pi and x(p) = — 1 if 
p G P2. Then there exists a rectangle R such that | = ||PnPi| — | i? n P2 1 1 > clogn. This 

implies that a data structure that answers queries with error | log n have to distinguish Pi and P2 . 
To distinguish all the 2 r2 ( nlogn ) point sets in V* , the data structure has to use at least f2(nlogn) 
bits. 

While point sets with low Lebesgue discrepancy or high combinatorial discrepancy have been 
extensively studied, we have constructed a large collection of point sets in which the pairwise 
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union has high combinatorial discrepancy. This particular aspect appears to be novel, and our 
construction could be useful in proving other data structure lower bounds. It may also have 
applications in situations where we need a "diverse" collection of random point sets. 

2 Data Structure 

In this section, we build a data structure that supports approximate range counting queries. Given 
a set of n points on an n x n grid, our data structure uses log ^ log log ^ log n) bits and answers 
an orthogonal range counting query with error en. 

First note that it is sufficient to only consider two-sided ranges. A two-sided range is a rectangle 
of the form [0, x) x [0, y), where (x, y) is called the query point. By the inclusion-exclusion principle, 
a 4-sided range counting query can be expressed as a linear combination of four two-sided range 
counting queries, so a data structure that answers two-sided queries with error en/4 can be used 
to answer 4-sided queries with error en. 

Our data structure is based an e-net for orthogonal rectangles. For a range space (P,7Z), a 
subset A C P is called an e-net of P if for any range R € 1Z that satisfies \P f~l R\ > en, there is 
at least 1 point in A n R. Note that an e-approximation is an e-net, but the converse may not be 
true. 

The size of the smallest e-net for orthogonal rectangles in 2 dimensions has been recently settled. 
Aronv, Ezra and Sharir [2] gave an algorithm for constructing an e-net of size O(^loglogi) for 
any P, while a matching lower bound was given by Pach and Tardos [15]. 

The data structure. Consider a set P of n points on an n x n grid. We first construct an e-net P £ 
of P for orthogonal rectangles. Let m = c\ log log ^ denote the size of this e-net, for some constant 
c. For simplicity we assume m is a power of 2. Let {ui = (xi, yi) \ i = 1, . . . , m} denote the points 
in P £ , in which the points are sorted on their x-coordinates. We call a point p € P an e-net point 
if p € P e . 

The key ingredient of our structure is a bit array attached to each e-net point, which can be 
used to determine whether or not to include this point in our estimation during the query process. 
More precisely, After constructing an e-net P £ , we will associate a bit array Cj of logm bits to each 
point Ui G P £ . The Z-th bit of Cj, denoted C,(Z), is called the l-indicator of U{. To construct Cj, we 
need the following definition of dyadic rectangles: 

Definition 2.1. For 1 < I < logm, we define dyadic rectangles at level I to be rectangles of the form 
[x a2 i , X( a+1 ) 2 i ) x [0, y) , for a = 0, . . . , n/2 l — 1 and y > 0. We use R v a l to denote the dyadic rectangle 
[x a2 i , X( a+1 ) 2 ; ) x [0, y) . In particular, we use R^ denote the dyadic rectangle [x a2 i , X( a +i)2' ) x [0' °°) • 

For 1 < I < logm, the /-indicators of the e-net points are determined as follow. Let R^ = 
[x a2 i , X( a+1 ) 2 x [0, oo) be a dyadic rectangle at level /. We use P a ,i = {pi, ■ ■ ■ ,p s } to denote the 
set of points of P in R^i, in which the points are sorted in their ^-coordinates. We divide P a j into 
chunks of size en. For each chunk, we find the lowest e-net point (there must exist one due to the 
e-net property, except possibly the highest chunk if it contains less than en points in P) and set its 
/-indicator to 1. The /-indicators of the other e-net points in P a> ; are set to 0. See Figure 1. Then 
we do so for each dyadic rectangle R^ and for all 1 < I < logm. 

Query process. Given a query point q = (q x ,Qy), we initialize a counter Q = 0. Let U{ q be the 
e-net point such that Xi q < q x < Xi q+ \. We note that the range [0, Xi q ) x [0, q y ) can be decomposed 
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Figure 1: Illustration of the construction of the /-indicators. The solid dots represent e-net points, 
and the crosses represent e-net points with /-indicators set to 1. 



into at most logm dyadic rectangles, and we use lZ q denote the set of these dyadic rectangles. For 
each dyadic rectangle R^ t G 1Z q and e-net point Ui G i?^, we add en to the counter Q if Cj(Z) = 1. 
After all dyadic rectangles in 1Z have been processed, we have 

Q= E E 

Then the data structure returns Q as the approximated count. Since the focus of the paper is 
on the space complexity, we have ignored query time. In fact, by precomputing some appropriate 
prefix counts on the Cj(Z)'s, Q can be computed in 0(log ^ + log logn) time; we omit the details to 
avoid digression. 

Analysis. We now analyze the error of the above estimation. We claim that Q estimates the 
number of points of P inside [0, q x ) x [0, q y ) with error at most (log m + l)en. The following lemma 
gives an upper bound of the error in a single dyadic rectangle. 

Lemma 2.1. Given a dyadic rectangle R v a i, The following inequality holds: 

0< E en-d{l)- PC\R y al <en. 

uieP e nfl», 

Proof. Recall that we divided the points in PDR^ into chunks of size en, and set the /-indicator of 
the lowest e-net point to be 1. Thus, each such e-net point represents exactly en points in P{~\R v aV 
except the highest one, which may represent less than en points. The lemma then follows. □ 

Since there are at most logm dyadic rectangles in TZ q , Q is an estimator of \P n [0, UjJ x [0, q y )\ 
with error at most en logm. We also note that there is no e-net point in (ui q ,q x ) x [0,^), so the 
error from this rectangle is at most en. This proves that Q estimates \P C\ [0, q x ) x [0, q y )\ with 
error (logm + l)en = (log (cX log log ^) + l)en. It is easy to see that our data structure uses 
0(i log log i logn) bits, which is dominated by the coordinates of the e-net points. By using 
e' = e/(log(ci log log ^) + 1) in place of e in the construction, we obtain a data structure that uses 
0(h log h log log A logn) bits and answers two-sided range counting queries with error e'n. 
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Theorem 2.1. Given a set of n points drawn from an n x n grid, there is a data structure that 
uses log j log log j logn) bits and answers orthogonal range counting query with additive error 
en. 

3 Lower Bound 

As argued in the introduction, our lower bound follows from the following theorem. 

Theorem 3.1. Let V denote the collection of all n-point sets drawn from an n x n grid. There 
exists a constant c and a sub- collection V* C V of size 2 f2 ( nlo s ri ) ) such that for any P\,P2 € V* , 
disc(Pi UP 2 ,K 2 ) > clogn. 

In the rest of this section, we will focus on proving Theorem 3.1. We will use the a collection 
of point sets called binary nets as a ground set to derive the point sets that satisfy the conditions 
in Theorem 3.1. Binary nets are a special type of point sets under a more general concept called 
(t,m, s)-nets, which are introduced in [14] as an example of point sets with low Lebesgue discrep- 
ancy. See the survey by Clayman et. al. [9] or the book by Hellekalek et. al. [12] for more results 
on (t, m, s)-nets. In this paper we will show that binary nets have two other nice properties: 1) A 
binary net has high combinatorial discrepancy, i.e., f2(logn); 2) there is a bit vector representation 
of every binary net, which allows us to extract a sub-collection by constructing a subset of bit 
vectors. In the following sections, we will define binary nets, and formalize these two properties. 

3.1 Definitions 

For ease of the presentation, we assume that the n x n grid is embedded in the square [0,n) 2 . We 
partition [0, n) 2 into n x n unit squares. We assume the grid points are placed at the mass centers 
of the n 2 unit squares, that is, each grid point has coordinates (i + 1/2, j + 1/2), for i, j <G [n], where 
[n] denote the set of all integers in [0, n). For the sake of simplicity, we define the grid point 
to be the grid point with coordinates (i + 1/2, j + 1/2), and we do not distinguish a grid point and 
the square it resides in. 

Now we introduce the concepts of (a, b)- cell and k-canonical cell. 

Definition 3.1. A (a, 6)-cell at position is the rectangle [i2 a , (i + l)2 a ) x [j2 b , (j + l)2 b ). We 
use G a> b(i,j) to denote the (a,b)-cell at position and G a ,b to denote the set of all (a, b)- cells. 

Definition 3.2. A ^-canonical cell at position is a (A;, logn — k)-cell with coordinates (i, j). 
We use Gk(i,j), to denote the k-canonical cell at position and Gk to denote the set of all 

k-canonical cells. 

Figure 2 is the illustration of (a, 6)-cells and canonical cells. Note that the position for a 
(a, 6)-cell takes value in [n/2 a ] x [n/2 b ]. In particular, we call Go(i, 0) the i-th column and Gi O gn(0, j) 
the j'-th row. Note that for a fixed k, G^ partitions the grid [0, n) 2 into n rectangles. Based on the 
definition of /c-canonical cells, we define the binary nets: 

Definition 3.3. A point set P is called a binary net if for any k € [logn], P has exactly one point 
in each k-canonical cell. 
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G a t,(3, 3) 2° 



2'' 



Go G\ 



Number of (a, 6)-cells: n/2 a G 2 G 3 

Figure 2: Illustrations of (a, 6)-cells and canonical cells. 

Let Vq denote the collection of binary nets. In other word, Vq is the set 

{P\ \PnG k (i,j)\ =l,k€ [\ogn],ie [n/2 k ],je [2 k ]}. 

It is known that the point sets in Vq have Lebesgue discrepancy O(logn); below we show that 
they also have O(logn) combinatorial discrepancy. However, the union of two point sets in Vq could 
have combinatorial discrepancy as low as 0(1). Thus we need to carefully extract a subset from 
Vo with high pairwise union discrepancy. 

3.2 Combinatorial Discrepancy and Corner Volume 

We first prove that the combinatorial discrepancy of any point set in Vq is large. To show this, we 
need the following definition of corner volume: 

Definition 3.4. For a point set P £ Vo and a k-canonical cell Gk(i,j), let q be the point in 
P fl Gk(i,j). Define the corner volume Vp(k,i,j) to be the volume of the orthogonal rectangle 
defined by q and its nearest corner of Gk(i,j). We use Sp to denote the summation of the corner 
volumes over all possible triples (k,i,j), that is, 

logn n/2 fc -l 2 fc -l 
fc=0 i=0 3=0 

See Figure 3 for the illustration of corner volumes. The following lemma relates the combina- 
torial discrepancy of P with its corner volume sum Sp. 

Lemma 3.1. There exists a constant c, such that for any point set P G Vq that satisfies 

Sp > cn 2 log n, 

we have disc(P, IZ2) = fi(logra). 

By Lemma 3.1, we can prove that the combinatorial discrepancy of a binary net is large by 
showing its corner volume sum is large. In particular, we have the following lemma: 



n/2\ 
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Figure 3: Illustration of the corner volume and the four analogous points. The area in shadow 
represents the corner volume Vp(k,i,j). 



Lemma 3.2. For any point set P € Vo, we have disc(P, 7^2) = fi(logn). 

Strictly speaking, Theorem 3.1 does not depend on Lemma 3.2 and Lemma 3.1, but these 
lemmas give us some insight on the binary nets. Moreover, a key lemma to proving Theorem 3.1 
(Lemma 3.4) shares essentially the same proof with Lemma 3.1. We defer the proof of Lemma 3.1 
and Lemma 3.2 to the Appendix in order to avoid digression into technicalities. 

3.3 A bit vector representation for Vq 

Another nice property of Vq is that we can derive the exact number of point sets in it. The following 
lemma is from the book [10]: 

Lemma 3.3 ([10]). The number of point sets in V is 2^ nlogn . 

We sketch the proof of Lemma 3.3 here, as it provides a bit vector presentation of each binary 
net, which is essential in our lower bound proof. 

Proof of Lemma 3.3. It is equivalent to prove that the number of possible ways to place n points 
on the n x n grid such that any /c-canonical cell Gk(i,j) has exactly 1 point is 22 nlo s n . We prove 
it by induction on n. Let Vo(n) denote the collection of binary nets of size n in a n x n grid. 

Observe that the line y = n/2 divides the grid [0,n) 2 into two rectangles: the upper grid 
[0, n) x [n/2, n) and the lower grid [0, n) x [0, n/2). For i even, let Ri denote the rectangle defined 
by the union of i-th and (i + l)-th columns Gq(i, 0) and Gq(i + 1,0). Note that the line y = n/2 
divides Ri into Gi(i/2,0) and Gi(z/2, 1), and therefore defines four quadrants. By the definition 
of Vo, for any point set P G Vq, the two points in Go(i,0) and G\{i + 1,0) must either reside in 
the lower left and upper right quadrants or in the lower right and upper left quadrants. There 
are in total n/2 even i's, so the number of the possible choices is 2 n / 2 . See Figure 4. Note that 
after determining which half the point in each column resides in, the problem is divided into two 
sub-problems: counting the number of possible ways to place n/2 points in the upper grid and the 
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n points in a n x n grid 



| points in | x | grid 



J V 
G„(0,0) G (1,0) 



G (n- 2,0) G o (n-l,0) 



Zp(0,0,0) = j J * ■■■ Z P (0,n/2-l,0) = { J * 



Figure 4: Illustration of the partition vector of Gq. 



lower grid. It is easy to show that each sub-problem is identical to the problem of counting the 
number of point sets in Vo(n/2), so we have the following recursion: 

|7>o(n)| = 2? -|PoW2)| 2 . 

Solving this recursion with Vq{1) = 1 yields that \V {n)\ = 25 nlo § n . □ 

A critical observation is that the proof of Lemma 3.3 actually reveals a bit vector representation 
for each of the point sets in Vq, which will allow us to refine the collection Vq. To see this, we define 
the partition vector Tip for a point set P € Vq as follows. For any (k, i,j) 6 [logn] x [n/2 fc+1 ] x [2 k ], 
consider the /c-canonical cells Gk(2i,j) and Gk{2i + l,j) and [k + l)-canonical cells Gk+i(i,2j) 
and Gk+i(i,2j + 1). The two fc-canonical cells overlap with the two (k + l)-canonical cells, which 
defines four quadrants. By the definition of binary nets, there are two points in P contained in 
these quadrants. We define Zp(k,i,j) = if the two points are in the lower left and upper right 
quadrants and Zp(k,i,j) = 1 if they are in the lower right and upper left quadrants. See Figure 4. 
We say the /c-canonical cells Gk(2i,j) and Gk(2i + is associated with bit Zp(k,i,j). Note that 
we use the triple (k, i,j) as the index into Zp for the ease of presentation; we can assume that the 
bits in Zp are stored in for example the lexicographic order of (k,i,j). Since the number of triples 
(k,i,j) is ^nlogn, the total number of bits in Zp is ^nlogn. Let Zq = {0, i^2 nlo ^ n denote the set 
of all possible partition vector Zp's. By the proof of Lemma 3.3, there is a bijection between Zq 
and Vq. 

3.4 Combinatorial discrepancy and corner volume distance 

Although we have proved that binary nets have large combinatorial discrepancy, it does not yet 
lead us to Theorem 3.1. In this section, we will refine Vq, the collection of all binary nets, to derive 
a collection V* , such that the union of any two point sets in V* has large combinatorial discrepancy. 
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In order to characterize the combinatorial discrepancy of the union of two point sets, we will need 
the following definition of corner volume distance. 

Definition 3.5. For two point sets P\,P2 £ Pq, the corner volume distance of P\ and P2 is the 

summation of \Vp 1 (k, — Vp 2 (k,i, over all (k,i,j). In other words, let A(Pi,P2) denote the 
corner volume distance of Pi and P2, then 

log n n/2 k -l 2 k -l 

A(Pi,P 2 ) = £ Yl J2\V Pl (k,i,j)-V P2 (k,i,j)\. 

k=0 i=0 j=0 

The following lemma (proof in the Appendix) relates the combinatorial discrepancy of the union 
of two point sets with their corner volume distance: 

Lemma 3.4. Let V* be a subset ofVo- If there exists a constant c, such that for any two point 
sets P\,P2 £ Vq, that their corner volume distance satisfies A(Pi,P2) > cn 2 logn, then disc(Pi U 
P2,n 2 ) = fi(logn). 

Here we briefly explain the high level idea for proving Theorem 3.1. By Lemma 3.4, it is 
sufficient to find a sub-collection V* C Pq, such that for any two point sets in V*, their corner 
volume distance is large. We will choose a subset Z\ C 2 0) and project each vector in Z\ down to a 
slightly shorter bit vector T. The collection T of all resulted bit vector T's induces a sub-collection 
Pi f= Pq, and each T represents a point set in V\. Then we prove that for any two point sets 
P±,P2 £ Pi, there is a linear dependence between the corner volume distance A(P±,P2) and the 
hamming distance of their bit vector representations T p 1 and T p 2 . Finally, we show that there is 
a large sub-collection of T with large pair- wise hamming distances, and this sub-collection induces 
a collection of point sets P* € P\ in which the union of any two point sets has large combinatorial 
discrepancy. 

We focus on a (k + 6, log n — fc)-cell Gk+6,iogn-k(i, j), for k € {0, 6, 12, ... , logn — 6}. Note that 
Gk+6,\ogn-k{h j) on ly contains (k + Z)-canonical cells for I £ [7]. Let Fk,i,j(l) denote the set of all 
(k + Z)-canonical cells in Gk+&,\ogn-k{h j), it is easy to see that 

F k ^ j ( l ) = {Gk+i(2 (i - l i + s,2 l j + t) \se [2 6 1,fe [2']}. 

Note that \Fk,i,j(l)\ = 64 for each I € [7]. Let Zk,ij(l) denote the set of indices of bits in the 
partition vector that are associated with the some (k + Z)-canonical cells in Gk+6,iogn-k(i, j), for 
I £ [6], i.e., 

Z k , id (l) = {(k + I, 2 5 ~ l i + s, 2 l j + t)\se [2 5 ~ l ],t G [2']}. 

Define Z^^j to be the union of the j j(Z)'s. Since there are 32 bits in Z^^j^l) for each I € [6], the 
total number of bits in Z^ij is 192 (here we use the indices in Z^ij to denote their corresponding 
bits in the partition vector of P, with a slightly abuse of notation). The following fact shows the 
Zk,i,/s partition all the ^nlogn bits: 

Fact 3.1. The number of Zk % j 's is log n; For different {k, i, j) and (k', Z^i jf\ Zy y y = 

0. ' 

The proof of the above claims are fairly straightforward: The number of different Zfc,ij' s is equal 
to the number of different Gk+e^iogn-kih i)' s - F° r a fixed k, the number of different (k+6, logn — k)- 
cells is n/64, and the number of different fc's is logn/6, so the total number of different Z^^^s is 
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n/2 k n/2 k+i 




Gk+6,\o g n-k(i,j) G k+3 (8i, 8j) 



Figure 5: Illustration of the 64 x 64 grid. The volume of each cell in Gfc + 3(8i, 8j) is n/64. The cells 
in shadow represent the corner volume difference of s\ and S2- 

g|^nlogn. For the second claim, we consider the following two cases: If k = k', we have(z, j) ^ 
This implies that the two (/c,logn — k + 6)-cells are disjoint, therefore the bits associated 
with the canonical cells inside them are disjoint. For k ^ k' , observe that we choose k and k' from 
{0, 6, . . . , log n — 6}, and Z^^j and Z^/yj/ only contain bits associated with (A; + Z)-canonical cells 
and (k' + /')-canonical cells, respectively, for /,/' G [6], so Zk t ij(l) and Zk'i'j'(l') are disjoint, for 
U'€[6]. 

The reason we group the bits in the partition vector into small subsets is that we can view each 
subset as a partition vector of the cell Gk+ejogn-kih j)> which allows us to manipulate the 

positions of the points inside it. More precisely, we can view Gk+6,iogn-k(i, j) as a 64 x 64 grid, 
with each grid cell being a (k, logn — k — 6)-cell in the original [0, n) 2 grid. Moreover, a (k + l)- 
canonical cell contained in Gfc+6,iogn-fc(^ j) corresponds to a /-canonical cell in the 64 x 64 grid. 
Note that there are 64 points in this grid, and the bits in Z^^j correspond to the partition vector 
of this 64-point set. Now consider a (k + 3)-canonical cell G/ c+ 3(8i, 8j), which corresponds to the 
lower left 8x8 grid in Gk+§,\ogn-k{h ])■ For each point set P G Vq, there is exactly one point in 
Gjfc_|_3(8i, 8j), and the bits in Z^ij encode the position of the point on the 8x8 grid. Suppose s± and 
S2 are two bit vectors of length 192, such that when the bits in Zk t ij are assigned as s± (denoted 
Zk,i,j = s i)> the point in Gfc + 3(8i, 8j) resides in the upper left grid cell,; and when Z^^j = S2, it 
resides in the grid cell to the upper left of the center of Gk+3(8i, 8j) (see Figure 5). Note that 
by this definition, the corner volume distance of this two point is at least n/8. Meanwhile, since 
there are no constraints on the other 63 points in Gk+6 t i gn-k(h j), it is easy to show that such 
assignments s\ and S2 indeed exist. 

By restricting the assignments of Z^^j to {si,^}, we have created a subset Z\ of Zq = 
{0,l}l" lo s™ : 

Zi = {Z\ Z kii>j = si or 82 , /c € {0, 6, ... , log n — 6}, i G [n/2 k+ % j G [2 k ]}. 

Let V\ denote the sub-collection of Vq that Z\ encode. By Fact 3.1, the number of Z^^j's is 
g^nlogn, so \V\\ = 25i4 nlo s n . Define a bit vector T of length ^nlogn, such that T(k,i,j) = 
if Zk t i t j = Si and T(k,i,j) = 1 if Zk,i„j = s%, then a bit vector T encodes a bit vector Z G Z±, and 
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therefore encodes a point set in V\. Let T = {0, l} 384 nlo s™ denote the collection of all bit vectors 
T. Then there is a bijection between T and Vi, and \T\ = \Vi\ = 2sk nlogn . 

Consider two point sets Pi and P 2 in V\. Let Tp x and Tp 2 denote the bit vector that encode 
these two point sets, respectively. The following lemma relates the corner volume distance of Pi 
and P2 with the hamming distance between Tp 1 and Tp 2 . 

Lemma 3.5. Suppose there exists a constant c, such that for any Pi,P 2 G V\, the hamming 
distance H(T > i; T > 2 ) > cn log n, then the corner volume distance between P\ and P 2 , A(Pi,P 2 ), 
is 17 (n 2 logn). 

Proof. We make the following relaxation on A(Pi,P2) : 

logrin/2 fc -l 2 k -l 

A(Pi,P 2 )=^ Yl E \V Pl (k,i,j)-V Pl (k,i,j)\ 

k=0 i=0 j=0 

n / 2 fc+6_ l2 fe_ 1 

^ E E E I (fc + 3, 8i, 8j) - (fc + 3, 8i, 8j)| . 

fce{0,6,...,logn-6} i=0 j=0 

Now consider the bits Tp 1 (k,i, j) and Tp 2 (k,i, j). If Tp 1 (k, i,j) ^ Tp 2 (k,i, j), then by the 
choice of si and S2 we have |Vp 1 (/c + 3, 8i, 8j — Vp 2 {k + 3, 8i, 8j)\ > n/8. So the corner volume 
distance A (Pi, P2) is lower bounded by the hamming distance H(Tp 1 , Tp 2 ) multiplied by n/8, and 
the lemma follows. □ 

The following lemma (probably folklore; see a proof in the Appendix) states that there is a 
large subset of T, in which the vectors are well separated in terms of hamming distance. 

Lemma 3.6. Let N = ^nlogn. There is a subset T* Q T = {0, 1}^ of size 2tg n , such that for 
any Ti 7^ T 2 G T* , the hamming distance PT(Ti,T 2 ) > \N . 

Proof. We embed T into a graph (V,E). Each node in V represents a vector T € T , and there 
is edge between two nodes Ti and T 2 if and only if P"(Ti,T 2 ) < |iV. By this embedding, it is 

equivalent to prove that there is an independent set of size 2ie N in (V, E). 

Fix a vector T G T, and consider a random vector T' uniformly drawn from T ■ It is easy to 
see that the hamming distance H(T, T') follows binomial distribution. By Chernoff bound 

Pr[#(T,T') < < e~Te N < 2~ts n . 

This implies that the probability that there is an edge between T and T' is at most 2~~> N . By the 
fact that T' is uniformly chosen from T, it follows that the degree of T is at most d = 2 N ■ 2~Te N = 
2ie N . Since a graph with maximum degree d must have an independent set of size at least \V \ /d, 
there must be an independent set of size at least 216^. □ 

Let V* denote the collection of point sets encoded by T*. By Lemma 3.6, \V*\ > 2Te N = 
26T34 nl °g ri . From Lemma 3.4 and Lemma 3.5 we know that for any two point sets Pi, / P 2 G V*, 
the combinatorial discrepancy of the union of Pi and P 2 is r2(logn). This completes the proof of 
Theorem 3.1. 
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A Appendix 



A.l Proof of Lemma 3.1 

Proof. The proof makes use of the Roth's orthogonal function method [16], which is widely used 
for proving lower bounds for Lebesgue discrepacy (see [8, 14]). Consider any point set P € Vq that 
satisfies Sp > cn 2 log n. Given any coloring \ '■ P ~ * {~ 1, +1} and a point x = (x\,X2) S [0,n) 2 , 
the combinatorial discrepancy D(x) at a point x is defined to be 

D(x) = Xip)- 

pe?n[o,j:i)x[o,D2) 

If we can prove sup a , e [ n )2 |-D(x)| = Jl(logn), the lemma will follow. 

For k € [logn], define normalized wavelet functions f k as follow: for each /c-canonical cell 
G k (i,j), let q denote the point contained in it. We subdivide G k (i, j) into four equal-size quadrants, 
and use G k (i,j)un, G k (i,j)uL, G k (i,j) LR , G k (i,j) LL to denote the upper right, upper left, lower 
right and lower left quadrants, respectively (See Figure 3). Set f k (x) = x(q)/ n2 over quadrants 
G k (i, j)uR an d G k (i, j)lLi and f k {x) = —x{<l)/ n2 over the other two quadrants. We say a function 
/ : [0, n) 2 — > R is (a, b)- checkered if for each (a,6)-cell G aj b{i,j), there exists a color C G {— 1,+1} 
such that / is equal to C/n 2 over G a ^(i,j)uR and G a ^(i,j)LL-, and f(x) = —C/n 2 over the other 
two quadrants. Note that the wavelet function f k is (k,logn — A;)-checkered. It is easy to see that 
the integration of a (a, 6)-checkered function over an (a, 6)-cell is 0. The following lemma is proved 
in the book [8]: 

Lemma A.l ([8]). /// is (a, b) -checkered and g is (a',b') checkered, where a < a' and b' < b, then 
fg is (a, b') -checkered. 

The following corollary can be directly derived from Lemma A.l: 

Corollary A.l. For < ki < . . . < k\ < logn, the function f kl (x) • • • f kl (x) is a (A;i,logn — ki)- 
checkered. As a consequence, we have 



ff kl 



fh=0. 



Note that the range of the integration is [0, n) 2 and the variable of integration is dx when not 
specified. We define the Riesz product 



G(x) = -l+f[( 1 f k (x) + l), 



k=0 

where 7 is some constant to be determined later. By the inequality 



GD 



< [ \GD\ < sup \D\ ■ f \G\ , 



we can lower bound the combinatorial discrepancy of P as follow: 



sup \D\ > 



\G\. (A.l) 
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For the denominator, we have 



\G\ 



logn 

1+ n(7/fc + i; 

k=0 

logn 



<i + EV E //*! 

/=0 0<fei<...<fe ( <logn 
log rt 

= 2+EV E 



hi 
hi 



1=1 0<fci<...<fe(<logn' 



= 2. 



(A.2) 



The last equation is due to Corollary A.l. 

The numerator | J* G(x)D(x)dx| can be expressed as follow: 



/ 



GD 



. / logn \ 

j (^- i+ n^ +i )j ° 

// logn logn \ 

hE^ + EV E Ai ■ ■ ■ /*. ] ■ -° 

y fc=0 Z=2 0<fci<...<fe ( <logn / 



>7 



logn 



fc=0 



log n 



Z=2 



E Jfk 1 ---fk l D 



0<fci<...<fc;<logn 



(A.3) 



We consider th integration of a single product fk(x)D(x) over a /c-canonical cell Gk(i,j)- Recall 
that q denotes the point in P that lies in Gk(i,j), and %(g) denotes its color. Define vectors 
u = (2 fe_1 ,0) and t> = (0, n/2 k+1 ). Then for any point x G Gk{i,j)LL, points x + u, x + u 
and x + u + v are the analogous points in quadrants Gk(i, j) lr, Gk(i,j)uL and Gk(i, j)jjR of x, 
respectively (see Figure 3). Let R x denote the orthogonal rectangle defined by x and x + u + v , 
and let function R(x) be the indicator function of point q and R x , that is, R(x) = 1 if q £ R x and 
R(x) = if otherwise. We can express the integration as 



/ h(x)D(x)dx= f ^ (D(x) - D(x + u) 

■/<?*(< J) JG k (i,j)LL 71 



L 



D(x + v) + D(x + u + v)) dx 

x(q) 



G k {i,j) L L 



x(q)R(x)dx 



n 2 L 



R(x)dx. 



G k (i,j) L L 

The second equation is because (D(x) — D(x + u) — D{x + v) + D(x + u + v)) only counts points 
inside R x , which can only be q, or nothing otherwise. Observe that R(x) = 1 if and only if one of 
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x's analogous points lies inside the rectangle defined by q and its nearest corner (see Figure 3), so 
we have 



L 



fkD =\\ R 

G k (i,j) n JG k (i,j) LL 



(A.4) 



Now we can compute the first term in (A. 3): 



lognn/2 fc -l 2 fc -l 



E / f* D =1 E E E / 

k=0 J k=0 i=0 j=Q JG k{hj) 



1 



k=0 i=0 j=0 jG k{i,j) 
lognn/2 fc -l 2 fe -l 

E E E >■<*.*.*) 

k=0 i=0 j=0 



n 2 



S P 



> 07 logn. 



(A.5) 



For the second term in (A. 3), recall that the function ■ ■ ■ fk t is logn— ^-checkered. Con- 
sider a (fci, log n — fc/)-cell Gk!,\ogn-k v Note that P intersects G^^ogn-kiih j) with at most 1 point. 
By similar arguments in the proof of equation (A.4) , we find the integration \ r , . /&,■■■ fk, D 

J ^fc^ ,log n — ki\ tij) 

is if P n Gfc^iogn-fc, = and otherwise equal to the corner volume of G^^ihj) divide by n 2 . 
In the latter case, we relax the corner volume to the volume of Gk 1: \ OS n-ki(hj), that is, n/2 hl ~ kl . 
Thus we can estimate the integration as follows: 



Ig, 



fk!'" fk t D 



< 



1 
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1 



n 



Since there are n non-empty (/ci,logn — fc/)-cells, we have 

1 



2 2 k i~ kl 2 kl ~ kl 



n 



fk!'-' fhD 



<n- 



Now we can estimate the second term in (A. 3) : 

logn 



EV 

1=2 



E J ^ ki " ^ h 



D 



0<fci<...<fe i <logn 



log n 

— / j ' / j 2 k i~ k t 

1=2 0<fei<...<fc ; <logn 
logn logn+1 , 

=EV E E 9» w- 



i=2 w=l— 1 ki—k!=w 



1-2 



(A.6) 



For the last equation we replace fc; — fci with a new index iu and use the fact that there are (Y_ 2 1 ) 
ways to choose fo, • • • , fej-i in an interval of length w. Note that for a fixed u>, there are log +1 — w 
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possible values for k±, so 



logn logn+1 , . logn logn+1 , 

Z^'Z^ Z^2 w \/-2/ Z^ ' 2^ V/-2 

Z=2 tu=/-l fc;-fci=tu v 7 Z=2 iu=J-l V 

logn logn+1 , 

logn/^-1 
- Z^ ' Z^ 2 W V / - 2 

Z=2 u)=«-l v 
logn logn+1 



"*»EV E h T: 2 )' < a - 7 » 

Z=2 w=J-l V 7 



By inverting the order of the summation, 

logn logn+1 , v logn+1 ?o+l 



logn^y £ £ ^(7r 2 ) =72logn e ^£(7r 2 ) 

/=2 u)=J-l ki-k 1= w v 7 10=1 Z=2 v 7 

logn+1 

7 2 logn £ + 



=2 7 2 logn ^ 

2 7 2 
1-7 



2 1 

10 = 1 

logn+1 / x w _i 

1+7 



ui=l 

2 



27 2 

<-^— logn. (A.8) 



2 7 2 

> c 7 log n log n. 

1-7 



So from (A.5), (A.6), (A.7) and (A.8) we have 

GD 

Setting 7 small enough while combining with (A.l) and (A. 2) completes the proof. □ 
A. 2 Proof of Lemma 3.2 

Proof. By Lemma 3.1, we only need to prove that the corner volume sum of any point set P G Vo 
is large. Fix k and consider a fc-canonical cell Gk(i,j). Let q denote the point in P n Gk(i,j). We 
define the corner x-distance of Gk(i,j) to be the difference between the x-coordinate of q and that 
of its nearest corner of Gk(i,j). The corner y-distance is defined in similar manner. See Figure 3. 
We use X(k,i,j) and Y(k,i,j) to denote the corner x-distance and corner y-distance, respectively. 
Note that the corner volume Vp(k,i,j) is the product of X(k,i,j) and Y(k,i,j). 

Fact A.l. Fix k audi, {X(k,i,j) | j G [2 fc ]} = {j + 1/2, j + 1/2 | j G [2 k ~ 1 ]}, where both are taken 
as multisets. 

For a proof, note that the /c-canonical cell Gk(i, j) is intersecting with 2 fc columns: Go(i2 fc , 0), . . . , Go 
l)2 fe — 1,0). There are 2 k points in Gk(i,0), . . . ,Gk(i,2 k — 1), and they must reside in different 
columns. Therefore there is exactly one point in the each of the 2 k columns, and their corner 
x-distances span from 1/2 to 2 fc ~ 1 /2 — 1/2, and each value is hit exactly twice. Similarly, we have 
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Fact A.2. Fix k and j , {X(k,i,j) \ ie [n/2 k ]} = {i + 1/2, i + 1/2 \ ie [n/2 k+1 ]}, where both are 
taken as multisets. 

Now consider the product of X(k,i,j) and Y(k,i,j) over all (i,j) for a fixed k: 

n/2 k -l2 k -l n/2 fc -l 2 fc -l 

n nvp{k,i,j)= n [jx(fc,i,i)y(fe,i,j) 

i=0 j=0 i=0 j=0 

n/2 fc -l 2 fc -l 2 fc -l"/2 fc -l 

= n n^-^i' n n 

1=0 j=0 j=0 1=0 

n/2 fc -l 2 fc -ln/2 fc -l 

= n n 0-+V2) 2 - n n (<+v2) 2 . 

i=o j=o j=o i=o 

The last equation is due to Fact A.l and Fact A. 2. By relaxing i + 1/2 and j + 1/2 to (i + l)/2 
and (j + l)/2, we have 

n/2 fc -l 2 fc -l n/2 fe -l 2 fc - 1 -l x . x 2 2 fc -l n/2 fc -l . . 

n n^,u)> n n ffVnn t 

i=o j=o i=o i=o v 7 j=o i=o v 

= n/ nVoV 2 nV(n/2^)!- 2 

i=o v 7 i=o v 

By the inequality x\ > (x/e) x , 

n nnM„)> n V n ((=£-) 

i=0 j=0 i=0 \ v 7 / j=0 \ v 7 / 

"/2 fe -l , fc _ lx 2'= 2*-l . , fc+lN n/2* 

= n (V) ir 



2e / 11 V 2e 

i=0 v 7 j=0 

,fc-l\2 fc -n/2 fc / ^ 2fc+lx n/2'=.2 fe 



2e J V 2e 

/2 fe V /-n/2^" 



\4e / V 4e 
n 



16e 

Using the inequality of geometric means, 



n j2 k -\2 k -\ /n/2 k -l2 k -l s ' '" 

i=o j=o y i=o j=o 

n 2 

> . 

- 16e 

So the corner volume sum Sp = Y^k=o Y^i=o ~* Y^f^ ^(^' j) ^ s l° wer bounded by n 2 logn/16e, 
and the lemma follows. □ 
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A. 3 Proof of Lemma 3.4 

Proof. Note that there are two points of P\ U P2 in each /c-canonical cell Gk(i,j), and we let qi,q2 
denote the two points from Pi and P2, respectively. We will set fk(x) = C/n 2 for quadrants 
Gk(i,j)uR and Gk(i,j)LL and fk(x) = —C/n 2 for the other two quadrants, where C is determined 
as follows: 



Let D(x) be the combinatorial discrepancy at x over P\ U P2. By similar argument in the proof of 
Lemma 3.1, we get 




X(qi) if V Pl (k,i, j) > Vp 2 (k,i,j); 
X(Q2) ifV Pl (k,i,j) < Vp 2 {k,i,j). 




f ^z(V Pl {k,i, j) +Vp 2 (k,i,j)) if x(Qi) = chi(q 2 ); 
"I i\V Pl (k,i,j)-V P2 (k,i,j)\ ifx(?i)^x(? 2 ). 



In either case, 




And the rest of the proof follows the same argument in the proof of Lemma 3.1. 



□ 
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